Effective balancing error and user effort in interactive handwriting recognition
نویسندگان
چکیده
Transcription of handwritten text documents is an expensive and timeconsuming task. Unfortunately, the accuracy of current state-of-the-art handwriting recognition systems cannot guarantee fully-automatic high quality transcriptions, so we need to revert to the computer assisted approach. Although this approach reduces the user effort needed to transcribe a given document, the transcription of handwriting text documents still requires complete manual supervision. An especially appealing scenario is the interactive transcription of handwriting documents, in which the user defines the amount of errors that can be tolerated in the final transcribed document. Under this scenario, the transcription of a handwriting text document could be obtained efficiently, supervising only a certain number of incorrectly recognised words. In this work, we develop a new method for predicting the error rate in a block of automatically recognised words, and estimate how much effort is required to correct a transcription to a certain user-defined error rate. The proposed method is included in an interactive approach to tranTel: (+34) 963877350 + 73533 Fax: (+34) 963877359 Email address: {nserrano,jcivera,josanna,ajuan}@dsic.upv.es (N. Serrano, J. Civera, A. Sanchis and A. Juan) Preprint submitted to Pattern Recognition Letters May 18, 2015 scribing handwritten text documents, which efficiently employs user interactions by means of active and semi-supervised learning techniques, along with a hypothesis recomputation algorithm based on constrained Viterbi search. Transcription results, in terms of trade-off between user effort and transcription accuracy, are reported for two real handwritten documents, and prove the effectiveness of the proposed approach.
منابع مشابه
Combining Neural Networks and Context-Driven Search for On-line, Printed Handwriting Recognition in the Newton
MESSAGEPAD and EMATE. Combining an artificial neural network (ANN) as a character classifier with a context-driven search over segmentation and word-recognition hypotheses provides an effective recognition system. Long-standing issues relative to training, generalization, segmentation, models of context, probabilistic formalisms, and so on, need to be resolved, however, to achieve excellent per...
متن کاملError Repair in Human Handwriting - An Intelligent User Interface for Automatic On-Line Handwriting Recognition
Several important factors, such as recognition accuracy, user acceptance, and system usability, have to be considered in designing an interface of a handwriting recognition system. Since both users and recognition algorithms make mistakes, it is desirable for the user interface of a handwriting recognition system to have mechanisms recovering from errors. In this paper we address the problem of...
متن کاملConfidence Measures for Error Correction in Interactive Transcription Handwritten Text
An effective approach to transcribe old text documents is to follow an interactive-predictive paradigm in which both, the system is guided by the human supervisor, and the supervisor is assisted by the system to complete the transcription task as efficiently as possible. In this paper, we focus on a particular system prototype called GIDOC, which can be seen as a first attempt to provide user-f...
متن کاملContext-Aware Gestures for Mixed-Initiative Text Editing UIs
This work is focused on enhancing highly interactive text-editing applications with gestures. Concretely, we study CATTI, a handwriting transcription system that follows a corrective feedback paradigm, where both the user and the system collaborate efficiently to produce a high-quality text transcription. CATTI-like applications demand fast and accurate gesture recognition, for which we observe...
متن کاملPreprocessing and Feature Extraction Techniques for Multimodal Interactive Transcription of Text Images
To date, automatic handwriting recognition systems are far from being perfect and heavy human intervention is often required to check and correct the results of such systems. This “post-editing” process is both inefficient and uncomfortable to the user. An example is the transcription of historic documents: State-of-the-art handwritten text recognition technology is not suitable to perform this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition Letters
دوره 37 شماره
صفحات -
تاریخ انتشار 2014